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Reptation governs motion of long polymers through a confining en- 
vironment. Slack enters at the ends and diffuses along the polymer 
czj ' as stored length. The rate at which stored length diffuses limits the 
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speed at which the chain can drift. This paper relates the rate of 
stored length diffusion to the conformation of the tube within which 
the polymer is confined. In the scaling limit of long polymer chains 
and weak applied electric fields, holding the product of polymer 
length times field finite, the tube length and stored length density 
O \ take on their zero- field values. The drift velocity then depends only 

on the the polymer's end-to-end separation in the direction of the 
field. 
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1 Introduction 



Gel electrophoresis separates charged polymers, such as sections of DNA, ac- 
cording to their length. Applied electric fields exert a uniform force per unit 
length parallel to the field. The gel contains pores and fibers which tend to 
entrap the long polymers within tubes [1] (see figure 1), impeding their drift. 
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Significant motion of the polymer occurs by the transport of stored length 

through the tube, a process known as reptation [2]. As a result of entanglement 

of the polymer in the gel, the polymer develops a complicated dependence of 

drift velocity on electric field £ and total polymer length C = l p N. We denote 

the polymer persistence length (typically of order 150-300 base pairs for DNA) 

as l P , and call N the "chain length" of the polymer. 

In the limit of weak field, the velocity is proportional to the electric force 
through the Nernst-Einstein relation 

* = ^(«jre)- (i) 

Here (qNS) is the electric force on the polymer, with q the charge within 
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Fig. 1. Polymer (solid curve) entangled in gel (squares) confined to tube (shaded 
area). Dots spaced at intervals of l p are called reptons. 

a persistence length. The diffusion constant D depends on the chain length 
according to the prediction of de Gennes [2] 



D ~ N~ 2 . 



(2) 



Comparing equations (1) and (2), the weak field drift velocity varies inversely 
with the chain length. Short chains travel more quickly than long chains, and 
chains may be sorted according to length based on their travel time across the 
gel, or their travel distance within a given time. 

Loss of length resolution, a serious impediment to electrophoresis as a sepa- 
ration tool, occurs for any polymer above a certain length. For DNA typical 
limits [3] are of order 50 Kbp, with chain length N of order 150-300. Increasing 
these limits requires reducing the electric field, thus increasing the run time, 
or using more delicate gels. Pulsed-field [3] and other techniques push the 
threshold length out yet further, reaching DNA lengths of order 10 7 bp, with 
chain length N of order 10 5 . From a mathematical point of view, loss of length 
resolution implies a breakdown of the Nernst-Einstein relation between drift 
velocity and zero-field diffusion constant. Understanding this problem requires 
further investigation of the functional form of the drift velocity Vd(N,S). 

Length and field dependence of the drift velocity have been well studied ana- 
lytically for biased reptation models [4]. These models assume constant tube 
length and conformation, independent of the applied electric field and inde- 
pendent of time. They conclude that diffusion of stored length depends only on 
the polymer tube length and end-to-end distance. Their dynamics is random 
polymer reptation biased by the applied electric field. Rubenstein [5] intro- 
duced what is now called the "repton model" (see figures 1 and 2), a lattice 
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Fig. 2. Repton model in the (n; s) representation. The numbers along the chain 
denote rtj, while the ± signs define Si . This configuration represents the polymer 
displayed in figure 1. The horizontal axis labels gel pores in sequence along the tube. 
The horizontal axis represents the height hi with respect to the first pore i = 0. In 
this figure L=ll, S=+3, JV =7, N=19 

model that dynamically generates its own tube. Later Duke [6] generalized 
the repton model to include the electric field. Semenov, Duke and Viovy [7] 
introduced fluctuations directly into the biased reptation model. 

The Repton model was studied intensively by B. Widom and co-workers [8-10], 
and by van Leeuwen and co-workers [11-13], among others. It is convenient to 
introduce fundamental length and frequency parameters a and u>, and to define 
a reduced electric field E = \qE\ajksT . Numerical simulations by Barkema, 
Marko and Widom [9] suggest a simple scaling behavior for the drift velocity, 
with 



—v d (N,E)^J(NE) 



(3) 



in the scaling limit 

E -^0, 



N -> oo, 



u = NE finite 



(4) 



They suggest an approximate functional form for the scaling function 

J{u) 




(5) 



This scaling function exhibits loss of length resolution because at fixed fields 
the drift velocity varies as E\E\, independent of N for long chain lengths. 
Barkema, Caron and Marko [14] discovered similar scaling relations for real 
DNA fragments and electrophoresis gels. 

The repton model decomposes the polymer chain into iV units called reptons, 



each incorporating one persistence length of the chain. When neighboring rep- 
tons on the chain lie within the same gel pore a unit of stored length separates 
them. Otherwise the reptons must occupy neighboring gel pores, because the 
bond between reptons prevents further excursions. Label the occupied gel 
pores with index % = 0, 2, ..., L. The bond from one pore % — 1 to the neigh- 
boring pore % we denote Si = ±1, where i — 1,2, ...L. The direction of pore 
labeling % is chosen as follows: the "head" of the polymer is the end that is 
furthest advanced in the direction of applied force. The other end is the "tail" . 
Labeling begins at the tail and runs to the head. As the polymer drifts, head 
and tail may occasionally interchange. The set of all bonds s defines the shape 
of the polymer's tube. 

Reptons may move along the tube when attached to at least one unit stored 
length. Stored length is recorded in the variable rii for the units of stored 
length in gel pore %. The set of stored length occupancy data for all gel pores 
is denoted n. The total amount of stored length, summed over gel pores, is 
iVo=N-l-L. When a repton moves it carries a unit of stored length from the 
pore which it left into the new pore. The potential energy of the polymer 
changes by Esi when a unit of stored length moves from pore i — 1 to i. The 
center of mass of the polymer advances by Sid/N. Allowed moves occur with 
frequency uB Si where the Boltzmann factor B = e E l 2 . The electric field favors 
either left- or right-wards motion depending on the sign of S{. 

The tube constantly changes shape while stored length migrates across the 
polymer. However, the changes in tube conformation for short times occur in 
a small length around the ends of the polymer. The time for stored length 
to traverse the tube is small compared to the time required for significant 
change in polymer conformation. This is clearly the case for zero field diffusion, 
because de Gennes [2] showed that the time required for tube renewal tr grows 
as iV 3 , while the diffusion of a unit of stored length takes the usual r$ ~ N 2 
time to travel a distance N. Thus the tube renewal time exceeds the time for 
stored length relaxation by a factor N. 

The factor of N, by which tr exceeds t$ persists for finite fields. Each unit of 
stored length that traverses the tube advances the polymer by one gel pore. 
Generation of a new tube therefore requires L units of stored length to diffuse 
across the polymer. Tube length L is proportional to chain length N. 

Examine the distance from each end of the polymer over which diffusion of 
stored length is sensitive to fluctuations in tube conformation. Denote this 
length scale l c , and note that the time scale for stored length diffusion across 
this length is I 2 . The time needed for renewal of this length of tube varies as Z 3 
because l c units of tube length must diffuse a distance of order l c . The length 
at which these time scales are comparable defines l c , of order 1. Further from 
the ends of the tube, the tube is effectively stationary while stored length 



diffuses. More information about the coupling of stored length diffusion to 
chain conformation near the chain ends may be found in reference [12]. 

We use the idea of fixed tube conformation on the time scale of stored length 
diffusion to simplify the analytic solution of the repton model with free bound- 
aries. By appropriately adjusting the dynamics at the free ends, we achieve 
closed form expressions for the drift velocity as a function of tube confor- 
mation. These expressions reproduce the drift velocity of the original repton 
model in the limit of long polymer chains. 



2 Stored length diffusion 



The joint probability distribution P(n; s) defines the probability for tube con- 
formation s and stored length distribution n within the tube. This function 
obeys the steady-state master equation 

J2 W(nV -»■ ns)P(n'; s') = J2 W ( ns ~* n's')P(n; s) (6) 



The left hand side is the rate of transitions into the state (n; s), while the right 
hand side is the rate of transitions out of that state. Approximate solution of 
this master equation follows the observation that stored length diffuses quickly 
compared to changes in tube conformation, except within distances of order 
1 of the tube ends. Fluctuations at the tube endpoints have negligible impact 
on the stored length probability distribution away from the tube ends. 

We replace the original free boundary repton model described by the master 
equation (6) with a simpler model in which the tube conformation s is held 
fixed. The simpler master equation 
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B ai P(--;m- 1 + i,m-i,---,s)] (7) 
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P(ms). 



may be solved exactly. The first term on the left describes the flux of stored 
length from gel pore % + 1 into i. The second term describes flux from % — 1 
to i. The right-hand side of equation (7) describes flux out of site i. The 
summations include as-yet undefined functions of stored length occupation 
and bond orientation just beyond each end-point of the tube. We define the 
probability function P(n; s) to be independent of the variables n_i and n L+ i, 



and we introduce fictitious bonds so = sl+i = 0. This is equivalent to placing 
the tube in contact with a reservoir of freely available stored length at each 
end. 

Our simpler model is identical to the periodic-boundary repton model studied 
by Van Leeuwen and Kooiman [11] with the exception of the boundary con- 
dition. The drift velocity of sufficiently long chains is not influenced by our 
alteration of the endpoint dynamics because it is diffusion of stored length 
along the interior of the tube that limits the drift velocity. The solution to 
equation (7) yields the stored length distribution within a tube of fixed con- 
formation s, but no information on the probability distribution for tube con- 
formations. Equation (7) has a well known solution [15] 



P(n;s) = nP?*- 

i=0 



Formally we extend this solution beyond the tube endpoints by defining 

P-i = Pl+i = 1 (9) 



in order that P(n; s) not depend on n_i and Ul+i- The form of solution (8) 
with free ends is identical to the solution with periodic boundary condi- 
tions [11,15], because the master equation is a local difference equation. 

The analysis that follows is largely adapted from the studies of the periodic 
boundary repton problem, and the free boundary problem in weak fields, by 
Van Leeuwen and co-workers [11-13]. Our approach is exact in the large N 
limit for the free end repton model in arbitrary electric field. The factors p, 
obey the single-particle master equation 

B~ Si+1 p i+1 + B Si Vi _ x = \B S ^ + B- s '} Pl . (10) 



The single-particle master equation (10) is second order, so the solution de- 
pends on two independent constants of integration. 

One constant of integration is the flux of stored length through each bond 

c{s)= Pi ^B Si - Pi B- s \ (11) 



Although the left hand side of equation (11) formally depends on i, the value 
of c is independent of i. The boundary condition p_i = 1, governing the avail- 
ability of stored length at chain ends, sets the other constant of integration. 



The full solution is 



Pi = p^B 211 * -cJ2 B 2{h ^ hk)+Sk (12) 

fc=0 



where we define 



hi = 2 s i ( 13 ) 

i=i 

the "height" of the tube in the direction parallel to the field, as indicated in 
figure 2. 

The form of solution (12) is interesting. The first term is the "barometric" dis- 
tribution of stored length, the expected equilibrium distribution in the absence 
of flux. An equivalent distribution occurs in the original repton model, when 
one repton is held fixed so the polymer cannot drift [16]. The second term cor- 
rects for the steady flow of particles. Since the boundary conditions (9) apply 
at both ends of the chain, the stored length flux c(s) is determined. Setting 
Pl+i = 1 in equation (12), we solve for 

B 2S -1 

c(s) = Efe±o 1 ^ 2(s -' lfc)+Sfc ' (M) 

In this expression, we define 

S = J>- (15) 

1=1 

as the end-to-end separation parallel to the field E. This quantity equals the 
height at the end of the tube. 

Given the probability distribution P(n; s) we calculate the tube-dependent 
velocity 

v(s) = J2 ^(n; s)v(n; s)/ ]T P(n; s). (16) 



Here the the imbalance of forward and backward transition rates for stored 
length within the tube governs the velocity associated with a particular allo- 
cation of stored length, 

W (n;s) = ^J20( ni )(s i+1 B S ^ - 8i B~'*). (17) 

iV i=0 



We include the natural velocity scale for the problem au/N, with a/N the 
center of mass displacement parallel to the field for each repton jump, and uj 
the jump attempt frequency. Once we know the velocity for arbitrary tubes, 
we approximate the full drift velocity Vd as an average of v (s) over tube con- 
formations s. 

It is convenient to interchange the summations over n in equation (16) with 
the sum over i in equation (17). Then 

v(s) = ^J29 l (s l+1 B s ^-s l B^) (18) 

iy i=0 



where 9i is the average of Q{nj) over the allocation of stored length 

£ = ]TP(n; 8)0(71,)/ £P(n;s). (19) 

n n 

Using the exact solution for P(n; s) equation (8), 

n j=0 n j=0 

The theta function picks out terms that contain at least one unit of stored 
length in gel pore i. 

Removing the obvious factor of Pi from the numerator of equation (20), the 
remaining factors are partition functions for the distribution of stored length 
among gel pores, 

L L 

q( n s)=y,Hp7 with Y, n J= N s- (2i) 

n j=0 j= 

The sums in equation (20) allocate N s = No units of stored length in the 
denominator but only N s = Nq — 1 units in the numerator. The ratio 

z(s) = Q(N - 1)/Q(N ) (22) 

is the fugacity for stored length. Hence 

0i = Piz(s) (23) 



is the explicit solution of equation (19). 



Substitute expression (23) for 9i into equation (18) for v(s). Utilizing the 
fictitious bonds Sq = Sl + i = to shift the summation index on the first term 
of equation (18), yields 

"00 = ^P E(Pi-iB Si - Pi B-^ Si . (24) 

iV i=0 



The quantity in parentheses above is the single particle flux defined in equa- 
tion (11). In particular, it is independent of i, and may be brought outside the 
sum. Finally, we obtain our principal result 

w N v ' 



This equation is fully general, holding for any field and any tube conformation. 
The relationship of v(s) to the repton model drift velocity Vd depends on the 
limit of long chains. We conjecture the average over all tubes s of v(s) equals 
the drift velocity Vd with corrections of order 1/iV relative to Vd caused by our 
fixed-tube boundary conditions. 



3 Scaling limit 



In the scaling limit (4) the electric field vanishes as 1/N, leading to significant 
simplifications. For weak fields, we simplify equation (14) for the single-particle 
flux by expanding in powers of E 

c(s) = J^L + 0(E*S 2 /L) (26) 



Interestingly, this value of c exactly counterbalances the barometric term in 
equation (27), ensuring that the first order term in pi grows only as EN 1//2 
even if the end-to-end distance S grows proportionally to N. It inhibits the 
tendency of stored length to accumulate at favorable positions on the chain. 
The simple form for the flux c arises because SiE is the force along any link 
between gel pores, so SE is the net force along the tube, and SE/L the net 
force per link. 

Simplify equation (12) for Pi by expanding 

Pi = 1 + Ehi - (i + l)c + 0{E 2 hj). (27) 
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Then substitute pi = 1 + O(E) into the partition functions Q defined in 
equation (21) to obtain the fugacity 



" iS)= N^L +0iES/L) <* 



with N + L = N - 1. 



Inserting equation (26) for flux c(s) and equation (28) for fugacity z(s) into 
equation (25) for velocity, we obtain the expression 

aooN S 2 E 
v(s) = —^^ (29) 



with corrections expected to fall off as 1/N in the scaling limit. Van Leeuwen 
and coworkers derived similar equations for the repton model in weak fields. 
Similar expressions are also known for biased reptation models [4,7]. We con- 
jecture that equation (29) is exact for the repton model with free boundary 
conditions in the scaling limit for all values of the scaling variable u = NE. 
We make a further conjecture, that equation (29) gives the drift velocity v^ 
with N , L and S set to their average values. 

One immediate result of expression (29) is the zero-field diffusion constant. 
Since S 2 = L = 2N/3 at zero field, the weak-field drift velocity is simply 
auj(NE)/3N 2 , in agreement with the accepted diffusion constant D = 1/3N 2 . 
Hence our conjectures hold at least at u — 0. 



4 Numerical study and discussion 



To test our conjecture that equation (29) holds throughout the scaling limit (4) 
we calculate averages of L, S 2 and v(s)/va for various values of the scaling 
variable u = NE, and then extrapolate to large chain lengths N. We test the 
conjecture for moderate and large u — 1 and 10. For short chain lengths, N=3- 
7 we perform exact calculations based on the matrix approach described in 
reference [8]. Because the matrix dimensionality grows as 3^ it is impractical to 
extend these calculations to large N. Instead, we perform numerical simulation 
of drift using the multi-spin simulation program of Gerard Barkema [9]. During 
these simulations we monitor S 2 and L in addition to the drift velocity. 

Figures 3 and 4 display our results for L/(N — 1) and S 2 /(N — 1), and the ratio 
v(s)/vd calculated by inserting average values of L and S 2 into equation (29) 
for the velocity in weak fields. We divide L and S 2 by N-l, because the finite 
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Fig. 3. Moderate field NE = 1 data. L/(N-1), S 2 /(N-1) and v(s)/v d are shown, 
respectively, in parts a, b and c. 11 
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Fig. 4. Strong field NE = 10 data. L/N, S 2 /N, and v(s)/vd are shown, respectively, 
in parts a, b and c. 12 



size correction then vanishes in the limit u = NE — »■ 0. Data for small N=3-7 
are exact matrix calculations, while data for large N are simulated. 

It appears that L/(N—1) — > 2/3 for any value of the scaling variable u. Dashed 
lines in figures 3a and 4a indicate the value 2/3. The finite size correction 
falls off proportionally to u/N. Our analysis of the scaling limit in section 
(3) explains these results. Assuming that S 2 equals N times a function of 
u, equation (28) predicts the fugacity takes its zero-field value with finite-size 
corrections of order E = u/N. The stored length density N /L, which depends 
on the fugacity, inherits the zero-field value and u/N finite-size correction and 
in turn passes them on to L/(N — 1). 

In contrast, the asymptotic value of S 2 depends on the scaling variable u. Vari- 
ation of the end-to-end separation causes the entire functional dependence of 
drift velocity on E and N, because N and L are constants in the scaling 
limit. References [7] and [9] predict a cigar-shape polymer conformation for 
large values of u, with elongation parallel to the field of order N\E\ 1 ' 2 . Assum- 
ing that the head and tail of the polymer lie near the top and bottom of this 
cigar shape, we identify the elongation with our variable S. From equation 
(29) it follows that the large u drift velocity varies as E\E\ independent of N, 
consistent with the large u form of J(u). 

Examining parts c of figures 3 and 4, we see that in the large N limit, the 
velocity predicted by the weak field limit (29) converges to the actual simulated 
velocity. The large N behavior is consistent with the ratio of equation (29) to 
simulated velocity approaches 1 as 1/N a with the power a falling between 1/2 
and 1. This holds for small, moderate and large values of the scaling variable, 
because finite-size and -field corrections vanish in the scaling limit. 

In conclusion, we present a general formula (25) for the drift velocity of any 
tube conformation in any electric field. The simpler result (29) applies through- 
out the scaling limit of long chains at fixed u = NE. Our approach works 
because diffusion of stored length decouples from fluctuations of chain con- 
formation except over a negligible length near the ends of the chain. The 
velocity is governed entirely by the end-to-end separation of the tube S. Tube 
fluctuations [7] are important only because they permit dependence of S 2 on 
u. 
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